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Abstract 

Typed feature structures are used extensively for the specification of linguistic infor- 
mation in many formalisms. The subsumption relation orders TFSs by their information 
content. We prove that subsumption of acyclic TFSs is well-founded, whereas in the 
presence of cycles general TPS subsumption is not well-founded. We show an application 
of this result for parsing, where the well-foundedness of subsumption is used to guarantee 
termination for grammars that are off-line parsable. We define a new version of off-line 
parsability that is less strict than the existing one; thus termination is guaranteed for 
parsing with a larger set of grammars. 
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1 Introduction 



Feature structures serve as a means for the specification of linguistic information in current 



linguistic formalisms such as LFG ( Kaplan and Bresnan, 1982 ), HPSG ( Pollard and Sag 



1994 ) or (some versions of) Categorial Grammar (paddock, Klein, and Morill, 198^ ). This 
paper focuses on typed feature structures (TFSs) that are a generalization of their untyped 
counterparts. TFSs are related by subsumption (see ( Carpenter, 1992b| )) according to their 
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information content. We show that the subsumption relation is well-founded for acyclic TFSs, 
but not for cyclic ones. We use this result to prove that parsing is terminating for grammars 
that are off-line parsable: this proposition is cited, but not proved, in ([Shieber, 1992| ). We 
also suggest a less strict definition for off'-line parsability that guarantees termination in the 
case of acyclic TFSs. 

This work has originated out of our interest in the theoretical aspects of parsing with 
grammars that are based on TFSs (see ( Wintner and Francez, 1995b| )). While the results 



presented here are basically theoretical, we have implemented a system for efficient processing 
of such grammars, based on abstract machine techniques; this work is presented in ( [Wintnei 



and Francez, 1995aD . The rest of this paper is organized as follows: section |^ outlines the 



theory of TFSs of ( [Carpenter, 1992b| ). In section |3| we discuss the well-foundedness of TFS 
subsumption. We sketch a theory of parsing in section |^ and discuss off-line parsability of 
TFS-based grammars in section |5[ 

2 Theory of feature structures 



This section summarizes some preliminary notions along the lines of ( parpenter, 1992b ) 



While we use the terminology of typed feature structures, all the results are valid for untyped 
structures, that are a special case of TFSs. ( Carpenter, 1992b| ) defines well-typed and totally 



well-typed feature structures that are subsets of the set of TFSs; for generality, we assume 
nothing about the well-typedness of TFSs below. However, the larger context of our work is 
done in a setup where features are assigned to types through an appropriateness specification, 
and hence we retain the term typed feature structures rather than sorted ones. For the 
following discussion we fix non-empty, finite, disjoint sets Types and Feats of types and 
features, respectively, and an infinite set Nodes of nodes, disjoint of Types and Feats, 
each member of which is assigned a type from Types through a fixed typing function 6 : 
Nodes Types. The set Nodes is 'rich' in the sense that for every t G Types, the set 
{q G Nodes | 9{q) = t} is infinite. 

Below, the meta-variable T ranges over subsets of Types, t ~ over types, / - over features 
and q - over nodes. For a partial function F, 'F(x)J,' means that F is defined for the value 
X. Whenever the result of an application of a partial function is used as an operand, it is 
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meant that the function is defined for its arguments. IN denotes the set of natural numbers. 

A partial order C over Types is a type hierarchy if it is bounded complete, i.e., if every 
up-bounded subset T of Types has a (unique) least upper bound, UT. If t Q t' , t is said to 
be more general than t' , which is more specific than t. t' is a subtype of t. _L = \J(j) is the 
most general type; T = UTypes is the most specific, inconsistent type. All occurrences of T 
are identified. 

A feature structure (over the parameters Nodes, Types and Feats) is a directed, 
connected, labeled graph consisting of a finite, nonempty set of nodes Q Q Nodes, a root 
q Q, and a partial function 5 : Q x Feats Q specifying the arcs, such that every node 
q Q is accessible from q. A, B (with or without subscripts) range over feature structures 
and Q,q,6 (with the same subscripts) over their constituents. [| Let TFSs be the set of all 
typed feature structures (over the fixed parameters Nodes, Types and Feats). 

A path is a finite sequence of features, and the set Paths = Feats* is the collection of 
paths, e is the empty path, vr, a (with or without subscripts) range over paths. The definition 
of 5 is extended to paths in the natural way: 5{q,e) = q and 5{q, fir) = 5{6{q, f),'ir). The 
paths of a feature structure A are n(^) = {vr | vr € PATHS and 6A{qA,'^)i}- Note that for 
every TFS A, Il{A) ^ (j) since e € n(A) for every A. 

A feature structure A = {Q,q,6) is cycUc if there exist a non-empty path a € Paths 
and a node q £ Q such that 6{q, a) = q. It is acyclic otherwise. Let ATFSs be the set of all 
acyclic TFSs (over the fixed parameters). A feature structure A = (Q,q,6) is reentrant iff 
there exist two different paths iri,ir2 G n(^) such that 5(q, vri) = 5{q,TT2). 

Definition 2.1 (Subsumption) Ai = {Qi,qi,Si) subsumes A2 = {Q2,Q2,^2) iff there 
exists a total function h : Qi ^ Q2 (a subsumption morphismj such that 

• Hqi) = q2 

• for every q E Qi, 9{q) C 6{h{q)) 

• for every q £ Qi and for every f such that 6i{q,f)i, h{5i{q, f)) = 62{h{q),f) 
^Untyped feature structures can be modeled by TFSs: consider a particular type hierarchy in which the set 

of types is the set of atoms, plus the types complex and _L, 1. subsumes every other type, and the rest of the 
types are incomparable. All features are appropriate for the type complex only, with _L as their appropriate 
value. Atomic nodes are labeled by an atom, non-atomic nodes - by complex and variables - by _L. 
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The symbol 'C' is overloaded to denote subsumption (in addition to the subtype relation). 

The morphism h associates with every node in Qi a node in Q2 with at least as specific a 
type; moreover, if an arc labeled / connects q with q' in Ai, then such an arc connects h{q) 
with h{q') in A2- Two properties follow directly from the definition: If A Q B then every 
path defined in A is defined in B, and if two paths are reentrant in A they are reentrant in 
B. 

If two feature structures subsume each other then they have exactly the same structure. 
The only thing that distinguishes between them is the identity of the nodes. This information 
is usually irrelevant, and thus an isomorphism is defined over TFSs as follows: A and B are 
alphabetic variants (denoted A B) iS A Q B and B Q A. A strictly subsumes B 
{An B) iS AQ B and A'/' B. 

If A strictly subsumes B then one of following cases must hold: either B contains paths 
that A doesn't; or there is a path in B that ends in a node with a type that is greater than 
its counterpart in A; or B contains 'more reentrancies': paths that lead to the same node in 
B lead to different nodes in A. 

Lemma 2.2 If A \Z B (through the subsumption morphism h) then at least one of the fol- 
lowing conditions holds: 

1. There exists a path vr G n(i?) \ Ii{A) 

2. There exists a node q € Qa such that 6{q) IZ 9{h{q)), 

3. There exist paths vri,7r2 G n(^) such that SAiqA,"^!) 7^ Sa{qa,'^2) but 5B(gB,vri) = 
SB{qB,-^2)- 



3 Well-foundedness 

In this section we discuss the well-foundedness of TFS subsumption. A partial order :^ on a set 
D is well-founded iff there does not exist an infinite decreasing sequence do y di y d2 y ■ ■ ■ 
of elements of D. We prove that subsumption of acyclic TFSs is well-founded, and show an 
example of general (cyclic) TFSs for which subsumption is not well-founded. While these 



results are not surprising, and in fact might be deduced from works such as, e.g., (Moshiei 
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and Rounds, 1987) or ( phieber, 19921 ), they were not, to the best of our knowledge, spehed 
out exphcitly before. 

Lemma 3.1 A TFS A is acyclic iffIl{A) is finite. 

Proof: If A is cychc, there exists a node q Q and a non-empty path a that 5{q, a) = q. 
Since q is accessible, let vr be the path from the root to q: 6{q, it) = q. The infinite set of 
paths {vra* | i > 0} is contained in n(A). 

If A is acyclic then for every non-empty path a € Paths and every q Q, 6{q, a) ^ q. Q 
is finite, and so is Feats, so the out-degree of every node is finite. Therefore the number of 
different paths leaving q is bounded, and hence n(^) is finite. □ 

Definition 3.2 (Rank) Let r : Types ^ IN be a total function such that r{t) < r{t') if 
tnt'. Foranacyclic TFS A, let A{A) = \U{A)\-\Qa\ andletQ{A) = Ettghca) ^(^'(^(g, vr))). 
Let rank : ATFSs IN he defined by rank{A) = \Il{A)\ + e{A) + A(^). 



By lemma 3.1, rank is well defined for acyclic TFSs. A(yl) can be thought of as the 'number 
of reentrancies' in A: every node q G Qa contributes in-degree(g) — 1 to A(^). For every 
acyclic TFS A, A(^) > (clearly Q{A) > and |n(yl)| > 0) and hence rank{A) > 0. 

Lemma 3.3 If A n B and both are acyclic then rank{A) < rank{B). 

Proof: Since AQ B, Ii{A) C Ii{B). Consider the two possible cases: 

• If n(^) = n(5), then 

- |n(^)| = |n(i?)|; 

— Q{A) < Q{B) by the definitions of Q and subsumption; 

— A(j4) < A{B) by the definition of subsumption, since every reentrancy in A is a 
reentrancy in B; 

- By lemma [23, either e(^) < 6(5) (if case (2) holds), or A{A) < A{B) (if case 
(3) holds). 

Hence rank{A) < rank{B). 

• If n{A) C U{B) then 



- |n(^)| < \UiB)\; 

- e{A) < e{B) (as above) 

- it might be the case that \Qa\ < \Qb\- But for every node q G Qb that is not 
the image of any node in Qaj there exists a path vr such that (^^(gBjVr) = q and 
TT n(^). Hence |n(^)| - \Qa\ < |n(B)| - \Qb\. 

Hence rank{A) < rank{B). □ 
Theorem 3.4 Subsumption of acyclic TFSs is well-founded. 



Proof: For every acycHc TFS A, rank{A) G IN. By lemma 3^, if A d B then rank{A) < 
rank{B). If an infinite decreasing sequence of acychc TFSs existed, rank would have mapped 
them to an infinite decreasing sequence in IN , which is a contradiction. Hence subsumption 
is well-founded for acyclic TFSs. □ 

Theorem 3.5 Subsumption of TFSs is not well-founded. 

Proof: Consider the infinite sequence of TFSs Aq,Ai,... depicted graphically in figure |^. 
For every i > 0, Ai Z\ Ai^i: to see that consider the morphism hi that maps g^+i to iji and 
5i+i{q,f) to Si{h{q),f) for every q G Qi+i (i.e., the first i + 1 nodes of ^j+i are mapped to 
the first i + 1 nodes of Ai, and the additional node of Ai^i is mapped to the last node of Ai). 
Clearly, for every i > 0, /ij is a subsumption morphism. Hence, for every i > 0, Ai ^ ^i+i- 
To show strictness, assume a subsumption morphism h' : Qi ^ Qi+i- By definition, 
^'(Qi) = Qi+i- By the third requirement of subsumption (h commuting with 6), the first 
i + 1 nodes in Ai have to be mapped by h' to the first i + 1 nodes in Aj+i. However, if q is 
the i + 1-th node of Ai, then 5i{q) leads back to q, while 5j+i(/i'(g)) leads to the last node 
of Ai^i (the cyclic node), and hence h' does not commute with 6, a contradiction. Hence, 
Ai □ Ai+i. 

Thus, there exists a strictly decreasing infinite sequence of cyclic TFSs and therefore sub- 
sumption is not well-founded. □ 
To conclude this section, note that specification, which is the inverse relation to subsump- 
tion, is not well-founded even when cyclic feature structures are ruled out. This fact can 
easily be seen by considering the sequence of feature structures Bq, Bi, . . ., where Bi consists 



A3 




Figure 1: An infinite decreasing sequence of TFSs 

of i + 1 nodes, the first i of which are labeled t and the last ~ ±, and an /-arc leads from 
every node to its successor. Clearly, Bi a -B^+i for every i > 0, and the sequence is infinite. 
This is true whether or not appropriateness constraints are imposed on the feature structures 
involved. 

In the general case, then, given a feature structure A it might be possible to construct, 
starting from A, both an infinite decreasing sequence of TFSs (by expanding cycles) and an 
infinite increasing sequence (by adding paths). 

4 Parsing 



Parsing is the process of determining whether a given string belongs to the language defined 
by a given grammar, and assigning a structure to the permissible strings. A large variety 
of parsing algorithms exists for various classes of grammars (for a detailed treatment of the 
theory of parsing with grammars that are based on feature structures, refer to ( ^hieber" 



1992; Sikkel, 1995; Wintner and Francez, 1995b| )). We define below a simple algorithm for 
grammars that are based on TFSs, but it must be emphasized that the results presented in 
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this paper are independent of the particular algorithm; they hold for a wide range of different 
algorithms. 

To be able to represent complex linguistic information, such as phrase structure, the 
notion of feature structures is usually extended. There are two different approaches for 
representing phrase structure in feature structures: by adding special, designated features to 
the FSs themselves; or by defining an extended notion of FSs. The first approach is employed 
by HPSG: special features, such as DTRS (daughters), encode trees in TFSs as lists. This 
makes it impossible to directly access a particular daughter. ( Shieber, 1992| ) uses a variant 
of this approach, where a denumerable set of special features, namely 0, 1, ... , are added to 
encode the order of daughters in a tree. In a typed system such as ours, this method would 
necessitate the addition of special types as well; in general, no bound can be placed on the 
number of features and types necessary to state rules (see ( [Carpenter, 1992b| , p. 194)). 

We adopt below the other approach: a new notion of multi-rooted feature structures, 
suggested by ( pikkel, 1993| ) and ( [Wintner and Francez, 1995b ), is being used. 
Definition 4.1 (Multi-rooted structures) A multi-rooted feature structure (MRS) 
is a pair {Q,G) where G = {Q,S) is a finite, directed, labeled graph consisting of a set 
Q C Nodes of nodes and a partial function 6 : Q x Feats — > Q specifying the arcs, and Q is 
an ordered (repetition-free) set of distinguished nodes in Q called roots. G is not necessarily 
connected, but the union of all the nodes reachable from all the roots in Q is required to yield 
exactly Q. The length of a MRS is the number of its roots, \Q\. A denotes the empty MRS 
(where Q = (p since \Q\ = 0). A MRS is cyclic under the same conditions a TFS is. A 
MRS is reentrant if it contains a node that can be reached either from two different roots 
or through two different paths. 

Meta-variables a,p range over MRSs, and 6,Q,Q over their constituents. If a = {Q,G) 
is a MRS and qi is a root in Q then qi naturally induces a feature structure Ai = {Qi,qi, Si), 
where Qi is the set of nodes reachable from qi and 6i = S\q.. Thus a can be viewed as an 
ordered sequence {Ai, . . . , A^) of (not necessarily disjoint) feature structures. We use the two 
views of MRSs interchangeably. 

Not only can nodes be shared by more than one element of the sequence; paths that start 
in one root can reach a different root. In particular, cycles can involve more than one root. 
Still, it is possible to define sub-structures of MRSs by considering only the sub-graph that 
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is accessible from a sub-sequence of the roots. 

Definition 4.2 (Sub-structure) The sub-structure of a = {Ai, . . . , An) , induced by the 
pair and denoted a^'"^ , is {Ai, . . . ,Aj). If i > j, a^"'^ = A. If i = j, we use o"* for o"* "*. 

Definition 4.3 (Subsumption of multi-rooted structures) A MRS a = {Q,G) sub- 
sumes a MRS a' = {Q',G') (denoted by a Q a') if \Q\ = \Q'\ and there exists a total 
function h : Q ^ Q' such that: 

• for every root q.; E Q, h{qi) = q[ 

• for every q G Q, 9{q) ^ 9{h{q)) 

• for every q £ Q and f G Feats, if d{q, f)i then h{6{q, f)) = 6'{h{q),f) 

Many of the properties of TFSs are easily adaptable to MRSs. Let n(cr) = {(vr,i) | vr G 
Paths, q is the i-th root in Qa and Jo-(9, tt) J,}. Then it is easy to show that a Q p then 
n(cr) C n(p) and every reentrancy in o" is a reentrancy in p. Moreover, if a \Z p (strictly) 



then at least one of the three conditions listed in lemma 2^ holds. 

The well-foundedness result of the previous section are easily extended to MRSs as well. 
Let @{a) = J2{n q)en{cT) '"(^(^cr(^) '^))) and A{a) = |n(cr)| — |(5o-|, and the same rank function 
of TFSs can be used to show the well-foundedness of (acyclic) MRSs. The reverse direction 
is immediate: in the presence of cycles, duplicate the example of the previous section k times 
and an infinite decreasing sequence of MRSs of length k is obtained, for any k > 0. For a 
detailed discussion of the properties of MRSs, refer to ( Wintner and Francez, 1995b| ). 



Rules and grammars are defined over an additional parameter, a fixed, finite set Words 
of words (in addition to the parameters Nodes, Feats and Types). The lexicon associates 
with every word w a feature structure Cat{w), its category .0 The categories are assumed to 
be disjoint. The input for the parser, therefore, is a sequence of (disjoint) TFSs rather than 
a string of words. 

Definition 4.4 (Pre-terminals) Let w = wi . . .Wn £ Words*. PTyj{j,k) is defined iff 

^ 1^ j, k < n, in which case it is the MRS {Cat{wj), Cat{wj^i), . . . , Cat{wk))- If j > k then 

PT^{j,k) = \. 

■^For the sake of simplicity, we assume that lexical entries are not ambiguous. In the case of ambiguity, 
Cat{w) is a set of TFSs. While the definitions become more cumbersome, all the results still obtain. 
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If a single word occurs more than once in the input (that is, Wi = wj for i j), its category 
is copied (with remaned nodes) more than once in PT. 



Definition 4.5 (Grammars) A rule is an MRS of length greater than or equal to 1 with 
a designated (first) element, the head of the rule. The rest of the elements form the rule's 
body (which may he empty). A grammar G = (TZ, Ag) is a finite set of rules TZ and a start 
symbol Ag that is a TPS. 

Figure ^ depicts an example grammar (we use AVM notation for this rule; tags such as |T] 
denote reentrancy). The type hierarchy on which the grammar is based is omitted here. 
Initial symbol: 

phrase 

CAT : 

Lexicon: 



word 

CAT : 



SEM : 



Rules: 



agr 

PER : 



sem 

PRE 



phrase 

CAT : 

AGR : 
SEM : 

phrase 

CAT : 

AGR : 
SEM : 



D : [john] 







_L 




word 

CAT : 



0, 



agr 

PER. : 



sem 

PRED : 



sign 

CAT : 

AGR : 
SEM : 



sign 

CAT : 

AGR : 
SEM : 



3rd J 

sg 



CASE : 







sem 

PRE 



word 

CAT : 



1^ nom j 



agr 

PER : 



sem 

PRE 



O [. 



sign 




CAT : 


w 


AGR : 


1 


SEM : 


2 



sem 

PRED 



0. 



(1) 



(2) 



Figure 2: An example grammar 

In what follows we define the notion of derivation (or rewriting) with respect to TFS-based 
grammars. Informally, this relation (denoted ''^') is defined over MRSs such that a ^ p iS 
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p can be obtained from a by successive application of grammar rules. The reader is referred 
to, e.g., ( pikkel, 1993 ; Shieber, Schabes, and Pereira, 1994; Wintner and Francez, 1995b| ) for 



a detailed formulation of this concept for a variety of formalisms. 

To define derivations we first define immediate derivation. Informally, two MRSs A and 
B are related by immediate derivation if there exists some grammar rule p that licenses the 
derivation, p can license a derivation by some MRS R that it subsumes; the head of R must 
be identified with some element i in A, and the body of R must be identified with a sub- 
structure of -B, starting from i. The parts of A prior to and following i remain intact in B. 
Note that R might carry reentrancies from A io B: if a path 7r2 leaving the i-th element of 
A is reentrant with some path vri leaving the a-th element, and -7r2, starting from the head of 
R, is reentrant with vrs in some element h in R, then vri and tt^ are reentrant in B, starting 
from the elements in B that correspond to a and 6, respectively. 

Definition 4.6 A MRS A = {Ai,...Ak) immediately derives a MRS B = {Bi,... B.^) 
( denoted A ^ B) iff there exist a rule p (zTZ of length n and a MRS R^ p, such that: 

• m = k + n — 2 

• R's head is identified with some element i of A: R^ = A^; 

• R's body is identified with a sub-structure of B: R?'""^ = ^«---*+'T'-2 

• The first i — 1 elements of A and B are identical: = J^^■■■^-^■ 

• The last k - i elements of A and B are identical: = Bm-{k-i+i)...m _ 

The reflexive transitive closure of '—>■ ', denoted '-^ is defined as follows: A A A" if A = A" 
or if there exists A' such that A —>■ A' and A' A A" . 



Definition 4.7 A MRS A derives a MRS B = (denoted A-^ B) iff there exist MRSs A', B' 
such that AQA',BQB' and A' ^ B' . 

Immediate derivation is based on the more traditional notion of substituting some symbol 
which constitutes the head of some rule with the body of the rule, leaving the context intact. 
However, as our rules are based on TFSs, the context of the "symbol" to be substituted might 
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be affected by the substitution. To this end we require identity, and not only unifiabihty, of 
the contexts. MRSs related by derivations should be viewed as being "as specific as needed", 
i.e., containing all the information that is added by the rule that licenses the derivation. This 
is also the reason for the weaker conditions on the relation: it allows an MRS A to derive 
an MRS B if there is a sequence of immediate derivations that starts with a sepcification of 
A and ends in a specification of B. 

Figure ^ depicts a derivation of the string "John loves her" with respect to the example 
grammar. The scope of reentrancy tags should be limited to one MRS, but we use the same 
tags across different MRSs to emphasize the flow of information during derivation. 



phrase 

CAT : 



AGR : 







agr 






PER : 


3rd] 




NUM : 


sg] 




sem 








PRED : 


r love 






ARGl : 


J] [john] 




ARG'2 : 


3 [she] _ 





sign 

CAT : 



AGR : 



CASE : 







[ nom ] 



agr 

PER : 



John 



sign 

CAT : 







sem 

PRED : 

ARGl : 
ARG2 : 



love J 

u 

4 



word 

CAT : 



CASE : 







agr 

PER : 
NUM 



sem 

PRED : |~3~| 

John 



John 



word 

CAT : 







agr 

PER : 



sem 

PRED : 

ARGl : 
ARG2 : 

loves 



love J 

31 



word 

CAT : 



CASE : 
agr 
PER : 
NUM : 



3rd J 

sg 



01 



her 



Figure 3: A leftmost derivation 
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Definition 4.8 (Language) The language of a grammar G is L(G) = {w & Words* | 
w = Wi • • • Wn and Ag ^ PT^(1, n)}. 

The derivation example of figure ^ shows that the sentence "John loves her" is in the language 
of the example grammar, since the derivation starts with a TFS that is more specific than 
the initial symbol and ends in a specification of the lexical entries of the sentences' words. 

Parsing is a computational process triggered by some input string of words w = wi ■ ■ ■ Wn 
of length n > 0. For the following discussion we fix a particular grammar G = {TZ, Ag) and a 
particular input string w of length n. A state of the computation consists of a set of items. 

Definition 4.9 (Items) An item is a tuple [i,a,j,k], where i,j € IN, i < j, a is an MRS 
and < k < \a\. an item is active if k < \a\, otherwise it is complete. Items is the 
collection of all items. 

If [i,a,j,k] is an item, a^- -^ is said to span the input from position i + 1 to position j (the 
parsing invariant below motivates this term), a and k can be seen as a representation of 
a dotted rule, or edge: during parsing all generated items are such that a is (possibly more 
specific than) some grammar rule. A; is a position in a, indicating the location of the dot. 
The part of a prior to the dot was already seen; the part following the dot is still expected. 
When the entire body of a is seen, the edge becomes complete. 

A computation amounts to successively generating items; we assume that item generation 
is done through a finite set of deterministic operations that create an item on the basis of 
previously generated (zero or more) items. Also, if an item was generated on the basis of some 
existing items, those items are not used again by the same operation. This assumption is 
realized by an important class of parsing algorithms known as chart parsers. A computation 
is terminating if and when no new items can be generated. A computation is successful if, 
upon termination, a complete item that spans the entire input and contains the initial symbol 
was generated: the final state of the computation should contain the item [0, a, n, 1], where 
A As and n is the input's length. Different algorithms assign different meanings to items, 
and generate them in various orders (see, e.g., ( [Shieber, Schabes, and Pereira, 1994 ; Sikkel 



1993D ). To be as general as possible, we only assume that the following invariant holds: 



Parsing invariant In a computation triggered by w, if an item [i,o", j, k\ is generated then 
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One immediate consequence of the invariant is that for all the items [i, a,j, k] generated when 
parsing w, < i < j < \w\. 

A parsing algorithm is required to be correct: 

Correctness A computation triggered by w is successful iffw€ L{G). 

Although ( Bhieber, 1992|) uses a different notation than ( |Wintner and Francez, 1995b| ), this 



property is proven by both. 

5 Termination of parsing 



It is well-known (see, e.g., ( Pereira and Warren, 1983 ; Johnson, 1988 )) that unification-based 



grammar formalisms are Turing-equivalent, and therefore the parsing problem is undecidable 
in the general case. However, for grammars that satisfy a certain restriction, termination of 
the computation can be guaranteed. We make use of the well-foundedness of subsumption 
(section ^ to prove that parsing is terminating for off-line parsable grammars. 

To assure efficient computation and avoid maintenance of redundant items, many pars- 
ing algorithms employ a mechanism called subsumption check (see, e.g., ( ^hieber, 1992 ; 



Sikkel, 1993D ) to filter out certain generated items. Define a (partial) order over items: 



[h,o-i,ji,ki] ^ [■i2,o'2,j2, ^2] iff h = «2,ji = J2,^i = ^2 and ai E fT2. The subsumption 
filter is realized by preserving an item x only if no item x' such that x' ^ x was generated 
previously. Thus, for all items that span the same substring, only the most general one 
is maintained. (Shieber, 1992; Wintner and Francez, 1995b| ) prove that by admitting the 



subsumption check, the correctness of the computation is preserved. 

Off-line parsability was introduced by ( [Kaplan and Bresnan, 1982 ) and was adopted 



by ( pereira and Warren, 1983D , according to which "A grammar is off-line parsable if its 
context-free skeleton is not infinitely ambiguous". As ( [Johnson, 1988| ) points out, this re- 
striction (which he defines in slightly different terms) "ensures that the number of constituent 
structures that have a given string as their yield is bounded by a computable function of the 



length of that string". The problem with this definition is demonstrated by ( Haas, 1989 ) 
"Not every natural unification grammar has a context-free backbone" . 
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A context-free backbone is inherent in LFG, due to the separation of c-structure from f- 
structure and the exphcit demand that the c-structure be context-free. However, this notion 
is not well-defined in HPSG, where phrase structure is encoded within feature structures 
(indeed, HPSG itself is not well-defined in the formal language sense). Such a backbone 
is certainly missing in Categorial Grammar, as there might be infinitely many categories. 
( Shieber, 1992| ) generalizes the concept of off-line parsability but doesn't prove that parsing 
with off-line parsable grammars is terminating. We use an adaptation of his definition below 
and provide a proof. 

Definition 5.1 (Finite-range decreasing functions) A total function F : D ^ D, where 
D is a partially- ordered set, is finite-range decreasing (FRD) iff the range of F is finite 
and for every d G D, F{d) ^ d. 

Definition 5.2 (Strong off-line parsability) A grammar is strongly off-line parsable 

iff there exists an FRD-function F from MRSs to MRSs (partially ordered by suhsumption) 
such that for every string w and different MRSs a,p such that a p, if a ^ PTw{i + l,j) 
and PTyj{i + then F{a) / F{p). 

Strong off-line parsability guarantees that any particular sub-string of the input can only be 
spanned by a finite number of MRSs: if a grammar is strongly off-line parsable, there can not 
exist an infinite set S of MRSs, such that for some Q < i < j < \w\, s ^ PTyj{i-\-\, j) for every 
s S. If such a set existed, F would have mapped its elements to the set {F{s) \ s G S}. 
This set is infinite since S is infinite and F doesn't map two different items to the same 
image, and thus the finite range assumption on F is contradicted. 

As ( iShieber, 1992| ) points out, "there are non-off-line-parsable grammars for which ter- 
mination holds" . We use below a more general notion of this restriction: we require that F 
produce a different output on a and p only if they are incomparable with respect to subsump- 
tion. We thereby extend the class of grammars for which parsing is guaranteed to terminate 
(although there still remain decidable grammars for which even the weaker restriction doesn't 
hold). 

Definition 5.3 (Weak off-line parsability) A grammar G is weakly off-line parsable 

iff there exists an FRD-function F from MRSs to MRSs (partially ordered by suhsumption) 

15 



such that for every string w and different MRSs a, p such that a ^ p, if a ^ PTyj{i + 1, i), 
p^ PTyj{i + 1, j), (y % p and p%(J, then F{a) ^ F{p). 

Clearly, strong off-line parsability implies weak off-line parsability. However, as we show 
below, the inverse implication does not hold. 

We now prove that weakly off-line parsable grammars guarantee termination of parsing 
in the presence of acyclic MRSs. We prove that if these conditions hold, only a finite number 
of different items can be generated during a computation. The main idea is the following: 
if an infinite number of different items were generated, then an infinite number of different 
items must span the same sub-string of the input (since the input is fixed and finite). By 
the parsing invariant, this would mean that an infinite number of MRSs derive the same 
sub-string of the input. This, in turn, contradicts the weak off-line parsability constraint. 

Theorem 5.4 If G is weakly off-line parsable and MRSs are acyclic then every computation 
terminates. 

Proof: Fix a computation triggered by w of length n. By the consequence of the parsing 
invariant, the indices that determine the span of items are limited (0 < z < j < n), as are 
the dot positions (0 < A; < \a\). It remains to show that for every selection of z, j and A;, 
only a finite number of MRSs are generated. Let x = [i, a, j, k\ be a generated item. Suppose 
another item is generated where only the MRS is different: x' = [i, p, j, k] and a ^ p. If cr C /o, 
x' will not be preserved because of the subsumption test. If p C cr, x can be replaced by 
x' . There is only a finite number of such replacements, since subsumption is well-founded for 
acyclic MRSs. Now suppose cr % p and p%o. By the parsing invariant, a"^-^ PTm{i-\-\.,j') 
and p"^-^ ^ PTwij- + Since G is weakly off-line parsable, F{<j^-^) ^ F{p^ -^). Since 

the range of F is finite, there are only finitely many items with equal span that are pairwise 
incomparable. Since only a finite number of items can be generated and the computation 
uses a finite number of operations, every computation ends within a finite number of steps. 
□ 

The above proof relies on the well-foundedness of subsumption, and indeed termination 
of parsing is not guaranteed by weak off-line parsability for grammars based on cyclic TFSs. 
Obviously, cycles can occur during unification even if the unificands are acyclic. However, 
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it is possible (albeit costly, from a practical point of view) to spot them during parsing. 
Indeed, many implementations of logic programming languages, as well as of unification- 
based grammars (e.g., ALE ( Carpenter, 1992a| )) do not check for cycles. If cyclic TFSs are 
allowed, the more strict notion of strong off-line parsability is needed. Under the strong 
condition the above proof is applicable for the case of non-well-founded subsumption as well. 

To exemplify the difference between strong and weak off-line parsability, consider a gram- 
mar G that contains the following single rule: 



t 



and the single lexical entry, wi, whose category is: 

t 

Cat{wi) = 

/: ^ 

This lexical entry can be derived by an infinite number of TFSs: 
t 



t 



t 



_L 



Cat{wi) 



t 

It is easy to see that no FRD-function can distinguish (in pairs) among these TFSs, and 
hence the grammar is not strongly off-line parsable. The grammar is, however, weakly off- 
line parsable: since the TFSs that derive each lexical entry form a subsumption chain, the 
antecedent of the implication in the definition for weak off-line parsability never holds; even 
trivial functions such as the function that returns the empty TFS for every input are appro- 
priate FRD-functions. Thus parsing is guaranteed to terminate with this grammar. 

It might be claimed that the example rule is not a part of any grammar for a natural 
language. It is unclear whether the distinction between weak and strong off-line parsability 
is relevant when "natural" grammars are concerned. Still, it is important when the formal, 
mathematical and computational properties of grammars are concerned. We believe that 
a better understanding of formal properties leads to a better understanding of "natural" 
grammars as well. Furthermore, what might be seem un-natural today can be common 
practice in the future. 
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